CDS
Accession Number | TCMCG075C20530 |
gbkey | CDS |
Protein Id | XP_017978654.1 |
Location | complement(join(24761930..24765591,24765900..24766068,24766402..24766515)) |
Gene | LOC18597028 |
GeneID | 18597028 |
Organism | Theobroma cacao |
Protein
Length | 1314aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018123165.1 |
Definition | PREDICTED: uncharacterized protein LOC18597028 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGTTGGAGAAGATCGGTTTGCCCACAAAGCCGTCGTTGAGAGGGAATAATTGGGTGGATGATGCTTCACATTGCCAAGGATGTTCTTCTCAGTTCACCTTCATCAATCGGAAGCATCACTGCCGAAGGTGTGGGGGCCTGTTTTGCAATAGTTGCACACAGCAAAGAATGGTCTTGCGTGGACAGGGTGATTCTCCTGTGCGTATTTGTGAACCCTGTAAAAAGCTAGAAGAGGCTGCGCGTTTTGAGTTGCGCCATGGATACAAAAGTAGAGCTGGTAGAGGTAGTTTGAAACCGGCTGCAAAAGATGAGGATGACATTTTGAACCAGATTCTAGGTGCCGACAGAAAGGAATCCTCTTCGTCAGGAGTAGCTTCCAACAAGGACATGAACCCTAGTGTTCGAAGGGCAGCTAGCAGTTCTTCATATTCAAATGTTCAAGCGGGTGTCAGTCATGATGGGGGAGGAGAAATATGTCGAAGTCAGTCTGTTGATCAGCCTATGCAAAACGATATGGCATCGTCCAGTCCCGAGGAGTTGCGCCAGCAAGCATTGGACGAAAAAAGGAAGTATAAAATTCTAAAAGGAGAAGGGAAATCTGAGGAAGCTCTAAGAGCCTTTAAGAGAGGGAAGGAGCTTGAGAGGCAGGCTGAGTCTTTGGAGATATATATAAGAAAAAACCGTAAAAAAGGTTTGCCATCTGGCAACATGTCCGAGATCCAGAATAAAGATGCGCCTAAAGAATCTGGCAGAAAAAGTAAGGTCCCTCATCAAGTGGGTAGGGATAAGGATGACCTGGCTGCTGAACTCAGGGAATTGGGGTGGTCTGATATGGATCTACATGACACTGATAAAAAATCCACAAATATGAGTTTGGAGGGTGAGCTCTCTTCTCTTCTTGGAGATATTCCTAAGAAGACTAATGCTCATGGCACTGATAAAACCCAGGTTGTTGCCATTAAGAAAAAGGCTCTTATGTTGAAACGTGAAGGGAAGCTTGCAGAAGCAAAGGAAGAACTTAAGAGAGCTAAAGTCTTAGAGAAGCAACTCGAAGAACAAGAAGTCTTGGCTGGAGCTGAAGATTCTGATGATGAGCTATCTGCAATAATCCATAGTATGGACGATGATAAACAAGATGAAATGTTAATTCAGTATGAGGATACTGATGACTTGGACTTTGATCACCTTGTGGGAACTGCTGATGATCTTGGTATTGATAGTAATTTTGAACTAACTGATAAGGATATGGAGGATCCAGAAATAGCTGCTGCTCTGAAATCACTAGGTTGGACCGAGGATTCTAACCCCACTGAAGATCTTGTGGCTCAGTCTGCTCCTGTTAATAGGGAGGCACTAGTTAGTGAAATTCTTTCATTAAAAAGAGAAGCTCTTAGTCAAAAGCAGGCAGGTAATGTTGCGGAGGCAATGGCTCAGTTAAAGAAGGCAAAGCTACTTGAGAAGGACCTTGAAAGCTTCGGTTGTCAAGCGGAGAATTTGACAGTGAATAAAAATGACCCAACTCCTCACACTTCTGACATATCAGTGAAGTCAGTTAAGTTGGGTGATGAAAATGTTAATGCTATTAAAGATGTGGATGTGAAACCTGCACCAAAGAGTGGATTGATGATTCAGAAAGAGCTTCTGGGATTGAAGAAGAAAGCCCTTGCTTTGAGAAGGGAAGGAAGATTGGATGAAGCAGAGGAAGAATTGAAGAAAGGCAAGATTCTTGAGCGCCAGCTTGAAGAAATGGAGAATACTTCAAACATGAAGGCTGCACAGGTACCTATCGGCAGTAAGGGTAAGGATATGATAATTGAGCATCCTTATGTATTAGAAAATCTGACGGTTGAAGGAGGAGATGTTACAGATCAAGACATGCATGACCCGACATACCTTTCAACCCTAAGGAACTTAGGTTGGAATGACAATGATGATGAGCGTTCAAACTCTTTGCTGAAACATTCTAAGCAAAAAGATTCTGAGCAAATTATTGAATCTTCTTTGACTTGTGCCCCTCCTAAAACCCCAGCCAAGGCATCAAGAAGAACTAAAGCTGAAATACAGAGGGAGTTATTAGGCTTGAAAAGGAAAGCTCTTTCTCTGAGGCGCCAAGGAAATACTGATGAGGCAGAGGAAGTGCTGGAAACAGCAAAAACATTGGAGGCTGAGATAGCAGAGATGGAGGCACCAAAGAAAGTGGTGGAATCGAACTGGCCTAACGAAAAAGCCATGTTGCCTCCCCTTAATAGTGCTGCGCAAGAAGCAGATGATGAGAATGTTACAGAGAAGGATATGAATGATCCAGCTCTGCTCTCAGTGCTAAAGAATTTGGGTTGGAAGGATGAAGAGCTTGAACATGCAACTATGCAAGAAAAGTACTCAAAAAGTGCTCGTGAGTCTTTACATTCTGGCCATCCATCTGTCTCTCAACCCTCTTCAGGAATTTCAGTTTCGCTGCCAAGAAGTAAAGGGGAAATCCAAAGAGAACTTCTGGGTTTGAAAAGAAAGGCTCTTGCCCTTCGACGAAATGGTCAAGCTGAAGAGGCTGAGGAGTTGTTGCAAAGGGCAAAGGTACTGGAAGCTGAAATGGCAGAATTGGAAGTTCCAAAAGGTGAGATTGTGCTTGATTCATCCAAGGACAGTACATCTGGGAACTCTGAATCATTTACTAATCAGGGAAGGCAAGGGAATTTAAAAAATGAAATGACATTAAAGGAAGGGCCAGTTGCAGTGGCAGTGGGTCCAAGTGAAACAGTCGTAGGATCATCAATCGGTTTAGGAAGAATGGAGAGCGATACAGATAATCCTACCCTGAGGAATTCCGAGCTGTTATTTCCTGCAGCCACCGGGCCACTAGAAGACAAGAAATCCTCATTTGAAAAATCAGATCCCTCAGGTGCAATGGGACTTCTAGGTGGTAAGGGAAAAGTTGAAACTGCTAGTTTTGTCTCTCCACCTGACCAGTCTGCAAACATAGTGGATTTGTTGACTGGCGATGACCTAATTAGTTCTCAGATACTAGCTGAGAAATTGAAAGAGAAAAGTGATTTTGGTTCCAACTTCTCTTCTCTTGCTAGACCGAATGTTCAGTTGGCTTCCCAAGAAGATCTTAGAACCAAGGATGAAGATACTACTGGAATAAGTAGAGTGGTTAATGGAGAGCAGAAGCCACATGCGTTTGATGTGAGTCCAGTTCAGGGATTTGTTTCTCATAACAGCCAAGATTCACTTAAGCAAGCAGTTTTGTCTCACAAGAAGAAGGCACTTGCTTTGAAGAGAGATGGAAAATTGGCAGAAGCTCGGGAAGAACTTCGGCAGGCAAAGCTGTTGGAGAAGAGTCTGGCAGAAGATAGCACTCCATCAAAAGGTGGTGCAAATGGTGCATCAACATCTTCATCCACTGTTCCCTCTGATGCACCAAAGGAGCAGGGTGCATCAAGTTTAGCTCCAAAACCACTGTCGGGGCGTGATCGCTTCAAGTTGCAACAGGAATCCCTCAGTCATAAGCGCCAGGCTTTGAAGCTACGAAGAGAAGGCCGGATGCAAGAAGCAGAAGCTGAGTTTGAAATGGCCAAGTCTCTTGAAGCCCAGTTGGAAGAGTTGGCTGGTCATGATTCAAGTAAGTCTTCTACCGTAGGGGCAGAACCAGTAGATGATGTAGGTGTTGAAGATCTTCTCGATCCTCAACTTTTGTCTGCCCTGAAAGCAATTGGTTTGGATGATTTAAGTGTTGTCGCTCGAGGCCCAGAAAGAACAGAGCCCGTAAAACCCAATGGTTCCAAAAGTGAAAAGGTTGACCAAGAGAGAATCCAATTGGAAGAGCGGATCAAGGCAGAAAAGTTGAAGGCAGTAAACTTGAAAAGGTCGGGCAAACAAGCTGAGGCTTTGGATGCTCTTCGGAGGGCCAAAATGCTGGAGAAAAAGCTCAATTCCTTGTCTTCATAA |
Protein: MLEKIGLPTKPSLRGNNWVDDASHCQGCSSQFTFINRKHHCRRCGGLFCNSCTQQRMVLRGQGDSPVRICEPCKKLEEAARFELRHGYKSRAGRGSLKPAAKDEDDILNQILGADRKESSSSGVASNKDMNPSVRRAASSSSYSNVQAGVSHDGGGEICRSQSVDQPMQNDMASSSPEELRQQALDEKRKYKILKGEGKSEEALRAFKRGKELERQAESLEIYIRKNRKKGLPSGNMSEIQNKDAPKESGRKSKVPHQVGRDKDDLAAELRELGWSDMDLHDTDKKSTNMSLEGELSSLLGDIPKKTNAHGTDKTQVVAIKKKALMLKREGKLAEAKEELKRAKVLEKQLEEQEVLAGAEDSDDELSAIIHSMDDDKQDEMLIQYEDTDDLDFDHLVGTADDLGIDSNFELTDKDMEDPEIAAALKSLGWTEDSNPTEDLVAQSAPVNREALVSEILSLKREALSQKQAGNVAEAMAQLKKAKLLEKDLESFGCQAENLTVNKNDPTPHTSDISVKSVKLGDENVNAIKDVDVKPAPKSGLMIQKELLGLKKKALALRREGRLDEAEEELKKGKILERQLEEMENTSNMKAAQVPIGSKGKDMIIEHPYVLENLTVEGGDVTDQDMHDPTYLSTLRNLGWNDNDDERSNSLLKHSKQKDSEQIIESSLTCAPPKTPAKASRRTKAEIQRELLGLKRKALSLRRQGNTDEAEEVLETAKTLEAEIAEMEAPKKVVESNWPNEKAMLPPLNSAAQEADDENVTEKDMNDPALLSVLKNLGWKDEELEHATMQEKYSKSARESLHSGHPSVSQPSSGISVSLPRSKGEIQRELLGLKRKALALRRNGQAEEAEELLQRAKVLEAEMAELEVPKGEIVLDSSKDSTSGNSESFTNQGRQGNLKNEMTLKEGPVAVAVGPSETVVGSSIGLGRMESDTDNPTLRNSELLFPAATGPLEDKKSSFEKSDPSGAMGLLGGKGKVETASFVSPPDQSANIVDLLTGDDLISSQILAEKLKEKSDFGSNFSSLARPNVQLASQEDLRTKDEDTTGISRVVNGEQKPHAFDVSPVQGFVSHNSQDSLKQAVLSHKKKALALKRDGKLAEAREELRQAKLLEKSLAEDSTPSKGGANGASTSSSTVPSDAPKEQGASSLAPKPLSGRDRFKLQQESLSHKRQALKLRREGRMQEAEAEFEMAKSLEAQLEELAGHDSSKSSTVGAEPVDDVGVEDLLDPQLLSALKAIGLDDLSVVARGPERTEPVKPNGSKSEKVDQERIQLEERIKAEKLKAVNLKRSGKQAEALDALRRAKMLEKKLNSLSS |